Regulator gene from Ustilago maydis

ABSTRACT

PCT No. PCT/EP96/04254 Sec. 371 Date Mar. 31, 1998 Sec. 102(e) Date Mar. 31, 1998 PCT Filed Sep. 30, 1996 PCT Pub. No. WO97/12911 PCT Pub. Date Apr. 10, 1997The invention relates to a regulatory nucleic acid fragment from Ustilago maydis and to its use.

The present invention relates to a regulatory gene from the fungus Ustilago maydis, to the use of this gene, and to fungal mutants which harbor a mutation in this gene.

Ustilago maydis, the organism causing blister smut of corn, has a two-phase life cycle. The haploid stage grows like a yeast and is not pathogenic. The dikaryon shows filamentous growth and represents the pathogenic form. The fungus has two mating type loci. The a locus, of which two alleles exist, is responsible for the fusion of haploid cells and the formation of the dikaryon. The multiallelic b locus controls the pathogenicity and the sexual development of the fungus. It codes for two homeodomain proteins (bW and bE) which form functional heterodimers.

A b locus-regulated gene egl1 codes for an endoglucanase whose expression is induced in the filamentous phase. Expression of the glucanase egl1 can be detected reliably and unambiguously using indicator plates based on carboxymethylcellulose with congo red. Hence egl1 is suitable as reporter gene for searching for genes with regulatory functions for the expression of differentially expressed genes. Since the filamentous phase of Ustilago maydis is pathogenic for corn, the object was to identify genes or gene products linked to the regulation of the filamentous phase. Such genes or gene products ought to represent suitable possibilities for intervention to eliminate the pathogenicity.

We have now found a nucleic acid fragment from the fungus Ustilago maydis which contains a regulatory gene.

The invention relates to a nucleic acid fragment from the fungus Ustilago maydis which comprises the XbaI-BglII fragment depicted in FIG. 1 and which comprises the nucleic acid sequence indicated in FIG. 3 between the BglII and the XbaI cleavage site.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 depicts a nucleic acid fragment from the fungus Ustilago maydis which comprises the XbaI-BglII fragment.

FIG. 2 depicts a section of the region of cosmid pUMcos7-H2 which harbors the XbaI-BglII nucleic acid fragment.

FIG. 3 depicts the sequence of the nucleic acid fragment from the fungus Ustilago maydis between the XbaI and BglII cleavage sites.

FIG. 4 depicts the amino acid sequence of the gene product of the nucleic acid fragment of FIG. 3.

The nucleic acid fragment was isolated in the following way:

A haploid Ustilago maydis strain FB1 (a1 b1) (Banuett, F. and Herskowitz, I. (1989), Different a alleles of Ustilago maydis are necessary for maintenance of filamentous growth but not for meiosis, Proc. Acad. Sci. U.S.A. 86: 5878-5882), which does not express the endoglucanase egl1 was subjected to UV mutagenesis. Screening of the resulting mutants for constitutive egl1 expression on carboxymethylcellulose plates followed by congo red staining revealed one mutant with the property which was sought, ie. filament-independent constitutive egl1 expression.

Detailed investigation of the resulting mutant revealed that other, normally differentially expressed, genes were expressed constitutively in this mutant. The gene affected by the mutation was further characterized by carrying out a complementation analysis (as described in detail in Example 3).

Analysis of the nucleic acid fragment revealed that the regulatory gene in this case presumably has a repressing action. This regulatory gene or the relevant gene product thus represents a possible specific site of action for fungicides. It is easily possible in a test method to test possible fungicidal compounds for interaction with the site of action which has been found, by bringing a haploid Ustilago strain which does not express endoglucinase egl1 in contact with the potential fungicide and determining whether egl1 expression takes place. In the positive case it is possible to assume interaction of the fungicide with the regulatory gene product.

A test method of this type can be carried out particularly straightforwardly if the screening for egl1 expression is done using carboxymethylcellulose plates, which are stained with congo red.

The invention furthermore relates to homologous nucleic acid fragments from other microorganisms which are likewise capable of functional complementation of a constitutively expressing Ustilago maydis mutant. Nucleic acid fragments of this type can easily be prepared by conventional methods of genetic engineering such as hybridization, by starting from the Ustilago nucleic acid fragment as probe and isolating and functionally testing appropriate clones, which hybridize under standard conditions, from other organisms.

The invention further relates to

a) a gene product of the above-defined nucleic acid fragment which is represented by the amino acid sequence depicted in FIG. 4 (SEQ ID No:2) and is encoded in particular by the nucleic acid sequence shown in FIG. 3, by the open reading frame starting with the ATG start codon of position 1847-1849 and ending with the TAG stop codon in position 45 8714-8716;

b) the use of the gene product as depicted in FIG. 4 or of part-sequences derived therefrom as target for fungicides.

EXAMPLE 1 UV Mutation of the Ustilago maydis Strain FB1

The strain was inoculated in YEPS liquid medium (Tsukuda, T., S., Fotheringham, S. and Holloman, W. K. (1988), Isolation and characterization of an autonomously replicating sequence from Ustilago maydis. Mol. Cell. Biol. 8: 3703-3709) and shaken at 28° C. The cultures were centrifuged at a cell count of 1×10⁶ to 3×10⁶ and resuspended in the same amount of double-distilled H₂ O. 1 ml of this cell suspension was treated with UV in a Petri dish. The irradiation times were chosen so that the survival rates were below 1%. Aliquots of the UV-treated cell suspension were then plated out.

The screening for mutants took place on carboxymethylcellulose plates (0.5% yeast extract, 0.4% Bacto peptone, 0.4% sucrose, 2% carboxymethylcellulose, 1.5% Bitek agar).

The colonies from the UV mutagenesis were replica plated on the test plates or, if the colony densities were too high, picked out and transferred, and incubated at 29° C. The cells were then washed off the plates for the test for egl1 expression. The plates were subsequently covered with 1% congo red for 20 minutes and decolorized with 1 M NaCl. The expression of egl1 is revealed by a pale halo.

EXAMPLE 2 Characterization of the Resulting Mutant

The mutant was crossed with a compatible wild-type strain (FB2 a2b2, Banuett and Herskowitz, 1989) in: Handbook of Genetics, Vol 1, R. C. King, ed. (New York: Plenum Press), pp. 575-595) on CM charcoal plates (Holliday, R. (1974) in order to eliminate mutations in the b locus. It emerged from this that the mutant chose the same crossing behavior as the starting strain.

RNA was prepared from liquid cultures of the mutant in YEPS (Schmitt, M. E., Brown, T. A., and Trumpower, B. L. (1990). A rapid and simple method for preparation of RNA from Saccharomyces cerevisiae. Nucl. Acid Res. 18, 3091-3092) and Northern analyses were carried out. Standard methods for blotting RNA onto nylon filters (Pall) were used (Sambrook J., Fritsch, E. F. and Maniatis, T. (1989). Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press).

The probe used for the Northern blots was a HindIII-SacI fragment from the open reading frame of the egl1 gene (dissertation of Florian Schauwecker "Isolierung und Charakterisierung einer filamentspezifischen exprimierten Cellulase aus Ustilago maydis", Free University Berlin, 1995).

DNA was radiolabeled with ³² P using the Megaprime® labeling kit (Amersham). It was shown that egl1 is expressed in the mutant. The expression of other differentially expressed genes was also detectable in the mutant.

The resulting mutant is not pathogenic in the haploid state. When it is crossed with a compatible wild-type strain, infection of corn plants is followed by tumor formation and the formation of basidiospores. It was possible to isolate a compatible strain which resulted after segregation, and likewise harbors the mutation, from the spores from such a cross. Although crossing of this strain with the originally obtained mutant leads to tumor formation in the plant, spores are no longer formed with this cross.

A mutated Ustilago strain produced in this way can be used, for example, to induce the formation of tumors (galls) in corn plants without the black discoloration by spores.

Since the galls of the corn plants are used as food in some cases, infection with the described Ustilago mutant has the advantage, compared with the wild type, that the resulting galls are visually more attractive.

EXAMPLE 3 Complementation of the Mutant with an Ustilago Cosmid Bank

The mutant was complemented with a cosmid bank which had been prepared from genomic DNA of the diploid strain FBD11 (a1a2b1b2) (Banuett and Herskowitz, 1989, see above). To do this, partially cut MboI fragments were cloned into Bam HI cleavage sites of the cosmid vector pUMcos. puMcos is a modified pScos 1 vector (Stratagene; Wahl, G. M., Lewis, K. S., Ruiz, J. C., Rothenberg, B. Zhao, J., Evans, G. A. (1987) Cosmid vectors for rapid genomic walking, restriction mapping, and gene transfer Proc. Natl. Acad. Sci. U.S.A. 84: 2160-2164). In pScos, a HindIII-SmaI fragment of the neomycin resistance gene has been replaced by an EcoRV-SmaI fragment which confers carboxin resistance in U. maydis (indicated by cbxR in FIG. 1).

The EcoRV-SmaI fragment is derived from pCBX122 (Keon, J. P. R., White, G. A., Hargreaves, J. A. (1991), Isolation, characterization and sequence of a gene conferring resistance to the systemic fungicide carboxin from the maize smut pathogen, Ustilago maydis. Curr. Genet. 19: 475-481).

The mutant was transformed with the cosmid bank. Transformation of U. maydis by the method of Schulz, B., Banuett, F., Dahl, M., Schlesinger, R., Schafer, W., Martin, T., Herskowitz, I. and Kahmann, R. (1990) (The b alleles of U. maydis, whose combinations program pathogenic development, code for polypeptides containing a homeodomain-related motif. Cell 60: 295-306). Pools of 98 cosmids were transformed.

Screening for complemented transformants took place on carboxymethylcellulose plates. It was possible to identify one complemented transformant. Chromosomal DNA was isolated from a YEPS liquid culture of this strain by the method of Hoffman, S. S. and Winston, F. (1987) (A ten-minute DNA preparation from yeast efficiently releases autonomous plasmids for transformation of Escherichia coli. Gene 57: 267-272). 1 μg of DNA was cut with SalI and religated in a volume of 20 μl. The ligation mixture was transformed into E. coli DH5a. Transformation was carried out using a Biorad electroporation unit in accordance with the manufacturer's protocol. The plasmids were isolated from the transformants by a boiling miniprep (Sambrook, J., Fritsch, E. F. and Maniatis, T. (1989). Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press).

Digestion with EcoRI and SalI resulted in a fragment from the rescue plasmid which consists of U. maydis DNA. This was radiolabelled and used to screen the individual filters of the cosmid pool with which complementation was possible. The hybridization was carried out at 65° C.

The cosmid pUMcos7-H2 (see FIG. 1), which had been identified in this way, was transformed anew into the mutant. It was possible in this way to confirm the complementation.

EXAMPLE 4 Characterization of the Complementing Nucleic Acid Fragment

Restriction analysis of the cosmid was carried out. The cosmid was digested with BamHI, and the fragments were fractionated and isolated on a 0.8% agarose gel. The fragments were cloned into the integrative vector pHLN4. pHLN4 is derived from pHL1 (Wang, J., Holden, D. W., and Leong, S. A. (1988). Gene transfer system for the phytopathogenic fungus U. maydis, Proc. Natl. Acad. Sci. U.S.A. 85, 865-869). pHL1 is a pUC12 derivative which harbors the hygromycin resistance gene under the control of U. maydis hsp 70 regulatory sequences.

NotI linkers were cloned into the SacI site in the polylinker of pHL1. pHLN is described in: Schulz, B., Banuett, F., Dahl, M., Schlesinger, R., Schafer, W., Martin, T., Herskowitz, I., and Kahmann; R. (1990). The b alleles of U. maydis, whose combinations program pathogenic development, code for polypeptides containing a homeodomain-related motif. cell 60, 295-306.

These plasmids were transformed into the mutant, and the transformants were tested for complementation. It was possible in this case to identify a 25 kb XbaI fragment which complements the mutation. The 25 kb XbaI fragment is indicated in FIG. 1 and is located between positions XbaI (32600) and XbaI (14000).

A BglII fragment about 10 kb in size, which is likewise complementing and is depicted in the Figure between positions BglII (15,000) and BglII (5100), was found in the same way.

As depicted in FIG. 1, the two complementing fragments have an overlapping region.

It was possible in this way to limit the complementing region to an XbaI-BglII fragment 8.9 kb in size. This fragment contains the genetic information necessary for complementation. It is depicted in FIG. 1 between positions XbaI (14,000) and BglII (5100). On the fragment there are also a BamHI site at pos. 10600 and a ClaI site at position 6900.

EXAMPLE 5 Determination of the Nucleic Acid Sequence of the Complementary XbaI-BglII Nucleic Acid Fragment

A 5.4 kb BglII-BamHI fragment and a 3.4 kb BamHI-XbaI fragment were obtained from the XbaI-BglII fragment. Both fragments were cloned into the Blueskript vector (Stratagene). The nucleic acid sequence of the 5.4 kb BglII-BamHI insert of the clone p1709951#3 (FIG. 2) and of the 3.4 kb BamHI-XbaI insert of the clone p1710951#9 (FIG. 2) were determined by DNA sequencing (see FIG. 3). In addition, an 11 kb BglII fragment which comprises the region of the XbaI-BglII fragment was isolated from the cosmid pUMcos 7-H2 and was coloned into the pSL 1180 plasmid vector (Pharmacia). The clone was called p2410951#1 (FIG. 2). DNA sequencing of the clone p2410951#1 confirmed the sequence of the BglII-XbaI fragment (FIG. 3). A gene product having the amino acid sequence indicated in FIG. 4 was identified by analyzing the open reading frames. The corresponding DNA sequence is depicted in SEQ ID NO:1, and the amino acid sequence of the gene product is depicted in SEQ ID NO:2.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES: 2                                              - (2) INFORMATION FOR SEQ ID NO: 1:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 8931 base                                                          (B) TYPE: Nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                 #1:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    - AGATCTACAC TCTCTTTACG ACCGTCTGTC AGTTAATGTT TCAAAGCCGA GC - #ATCGTCCT          60                                                                           - GGATGCAGCT TGCTTTGCAG GCGTTAAAAC CACATTTTTT CGATTTCGCT CT - #TCTCTATC         120                                                                           - AGTTGTTTCT TGATTATATC TCGTGATACG TTGCATACCT TGTACACGGG TC - #GCAATTTA         180                                                                           - AGAGTTTCGT ATTCATGACA CTCGTGACTG ATCTTGGCTT GCTCAAAGTT GG - #TCGTGCTC         240                                                                           - TGAAGACAGT GGCGATGCTA ACAGACGATG CGGTATACAG GGAGACAAGG GT - #ATGTCACG         300                                                                           - GTCGCTGGTT CGGAGATGTG TGAGCAGCAG GTGTGGACGC TAGCGATTCA GA - #GGTCGAAT         360                                                                           - GAGCTCCTTC GCAACAAGGC GAGGCAAAGC GTCCGTCAAG GCAGGGCATG CT - #GATGAGGA         420                                                                           - GCTGCCACGC GAGTCGCGCC GACAGCGAAG CACACATGAC GAGTCCCACG AC - #ACGGTCGT         480                                                                           - GGCCCGACGT CTACCACCGG TGCTGGTGTG TGCTGTAGGA TGATTTGGAC AG - #GGAGAGGA         540                                                                           - AGGCACACGA CACGCGATCA GCTTGATAGC GAGCAACCCT TGTTCTGCTT GT - #TTTTGAAC         600                                                                           - AGAGGGTGCT GTCATACGAG CGGCCAGACG CTGGCAATGG CCCACCGAAT GT - #GATGGAGG         660                                                                           - AAAGCGCCAT GGCAAGGCGC AGGCTCAGGA ACGATAAGAT GGTTTGATTC CC - #TATTCGAC         720                                                                           - ACACGAGTTG GCTATTAGTC GTGAGCGTGA GTCGTGAGTG TGACTGGCGA AA - #CACGACAG         780                                                                           - GTGCAGAGCT CGGCAACGAA CTTTCGACTT GCGCCCACTT TACGCGGAGG AG - #CCGGGCGA         840                                                                           - GAGACAGCGA GGATGCTTGA AAAATAAAAC AGGACAGGAA TAACAACAAG TC - #GGCCATGC         900                                                                           - TTGACAGTCG GTGCATTTGC AGTGTGTCTA ATTCTGCGAG GCATCACGCC TC - #CTGCCTTT         960                                                                           - TTTCCTCCAA ACCCACCCTT CCTGGCAGAA AGTGAGGCAG CAGAGAGAGG AA - #AAGAGAGA        1020                                                                           - AAGAGTCACG AGTGGGTGCC GCAATCACGA ATATATTAAT AATTCGAAAC CA - #AAACGCTT        1080                                                                           - TACAAGCATC AAGCACGAAG CTTAGGCCCT GCGTAATTCA CGATTCACGA TT - #CACGATTC        1140                                                                           - ACGATTGTGA CTTGGAGCCA AGGCTGTGGC CTCTGTCGCC TAACCCTTTT TT - #GACAGAGG        1200                                                                           - CTTCTGAGCT TGAGCGTGGG TGCCACAAAC TTGTCACGGC CTCCGAAATC AG - #AGGTCCTC        1260                                                                           - CTCATCATCT CAATTTTTTT TTGTGTTGGC GTGTCGTTCG CGCTAGCATC AA - #GACCATCA        1320                                                                           - CCACCATATC GATCACCGCG GCTGTGCTGC TGTCCGCCGC CCTGTCAGTA CT - #GCTCCACG        1380                                                                           - GTTCTCCTCA ATCTCCGCCT CGCTGTTGTG TTTGTCTTCC ACGGTGCTTC GC - #GCTCAGCT        1440                                                                           - GCTGTGCTTT GATCACACAT CGGGGTCGAT TCCTAACTTG TGTAAGCGCC TT - #ATCCCACC        1500                                                                           - CAATTTGCCA AGTCGATACC GCCGTTCTAT CGCGGCTCTG GCCGGCGATT CT - #TGGCACTT        1560                                                                           - TCCCAACAGT TGCCTTGTCG GACTAGCCGC AATCTTGGCT CACAACGTGC AA - #ACTATCTG        1620                                                                           - TCGAGATTCA TGACGGCGCT TCCCTAGTCA GGATCGGAAA ACGCCGCTCT CT - #CTGTGCCA        1680                                                                           - CATCGATTTC GCCTGCCACC AAAGGTGCAC CTCTCTGTCT CTCCTGAATA CG - #TCCGCCGC        1740                                                                           - TCATCTCAGC ATCCTTCTCC TCCGTTCTAT CTCCCTGCGC CTGCTGTTGT TC - #ATCGATAT        1800                                                                           #ACA TCC      1855ACACA CGTTGGCAGC TCCTCGATCT CTAACC ATG                       #Ser            Met Thr                                                        #               1                                                              - ACC AAG ACA AGC TCT CAG CCC TCA GGC TCG AG - #T GAC ACT CCT CAA CGC          1903                                                                           Thr Lys Thr Ser Ser Gln Pro Ser Gly Ser Se - #r Asp Thr Pro Gln Arg            #       15                                                                     - ATC GTC GTC AAA TCC GTC AAT GGT CAC GAG CC - #C ATC AAA GTA GAG CCT          1951                                                                           Ile Val Val Lys Ser Val Asn Gly His Glu Pr - #o Ile Lys Val Glu Pro            # 35                                                                           - GTC TCC TCG TCC AGC GCA TCA CTG CTT CAC TC - #C ACA CCG CCC CGG CTC          1999                                                                           Val Ser Ser Ser Ser Ala Ser Leu Leu His Se - #r Thr Pro Pro Arg Leu            #                 50                                                           - GCC ACA CCG CTT TCC TCA CCC ACC AAA TCA GC - #C GCC CCC TCG AGT CCA          2047                                                                           Ala Thr Pro Leu Ser Ser Pro Thr Lys Ser Al - #a Ala Pro Ser Ser Pro            #              65                                                              - TCC AAG TCT CCA GGA AGG GCG CGA CGT GTA GA - #T CCT GTC TTG ATC TCG          2095                                                                           Ser Lys Ser Pro Gly Arg Ala Arg Arg Val As - #p Pro Val Leu Ile Ser            #          80                                                                  - TCT CGC GAG TTT GGC CCT AGT GCT GGA GGA GA - #C AGC GAT GAC GAT GAG          2143                                                                           Ser Arg Glu Phe Gly Pro Ser Ala Gly Gly As - #p Ser Asp Asp Asp Glu            #    950                                                                       - TTC AAC AAT GGC GAG CCC GAA GTT TAC AAG GG - #C GTC AAC ACC ACT GCC          2191                                                                           Phe Asn Asn Gly Glu Pro Glu Val Tyr Lys Gl - #y Val Asn Thr Thr Ala            100                 1 - #05                 1 - #10                 1 -        #15                                                                            - AAA AGG CTC AGC CGC AAG AGC AAG GCG GAT GC - #T ATG TTC GCC ATG TCC          2239                                                                           Lys Arg Leu Ser Arg Lys Ser Lys Ala Asp Al - #a Met Phe Ala Met Ser            #      130                                                                     - GTC AAA GAG TCT TCT CCC GTC CAT GCG CAC GC - #A ACA TCC ACA TCT ACC          2287                                                                           Val Lys Glu Ser Ser Pro Val His Ala His Al - #a Thr Ser Thr Ser Thr            #  145                                                                         - ACT TCA AAT GCT CCC ACT GCC ATC CCC GGC AA - #C CCT GCC TCC CAC CCA          2335                                                                           Thr Ser Asn Ala Pro Thr Ala Ile Pro Gly As - #n Pro Ala Ser His Pro            #       160                                                                    - GCA CGC AAA ATG TTC CAG CAC CAG CCA TTT CC - #G CCT CTG GTT TTC GAT          2383                                                                           Ala Arg Lys Met Phe Gln His Gln Pro Phe Pr - #o Pro Leu Val Phe Asp            #   175                                                                        - ACA GAT CCC ATC TCA TCC ATC TCG CAG TCT CC - #A TCC GCT TCC AAT GCG          2431                                                                           Thr Asp Pro Ile Ser Ser Ile Ser Gln Ser Pr - #o Ser Ala Ser Asn Ala            180                 1 - #85                 1 - #90                 1 -        #95                                                                            - GCT CAG CCC CCC ATT CCA ACT CAT GCC AGC AC - #G CCA CGC TGC CCT CCG          2479                                                                           Ala Gln Pro Pro Ile Pro Thr His Ala Ser Th - #r Pro Arg Cys Pro Pro            #               210                                                            - CCC AGG CTC CGT CCC AGA CTC TTT GAG CTC GA - #C GAA GCG CCC ACT TTC          2527                                                                           Pro Arg Leu Arg Pro Arg Leu Phe Glu Leu As - #p Glu Ala Pro Thr Phe            #           225                                                                - TAT CCA TCG CCT GAA GAG TTC TCT GAT CCA AT - #G AAG TAC ATC GCC TGG          2575                                                                           Tyr Pro Ser Pro Glu Glu Phe Ser Asp Pro Me - #t Lys Tyr Ile Ala Trp            #       240                                                                    - ATC GCC GAC CCA CAA GGT GGT AAT GGC AAG GC - #A TAC GGC ATC GTC AAG          2623                                                                           Ile Ala Asp Pro Gln Gly Gly Asn Gly Lys Al - #a Tyr Gly Ile Val Lys            #   255                                                                        - ATC GTT CCA CCT CAG GGC TGG AAC CCG GAA TG - #C GTG CTT GAT GAG CAG          2671                                                                           Ile Val Pro Pro Gln Gly Trp Asn Pro Glu Cy - #s Val Leu Asp Glu Gln            260                 2 - #65                 2 - #70                 2 -        #75                                                                            - ACC TTC CGC TTT CGC ACC CGC GTT CAG CTC CT - #C AAC TCG CTC AGT GCA          2719                                                                           Thr Phe Arg Phe Arg Thr Arg Val Gln Leu Le - #u Asn Ser Leu Ser Ala            #               290                                                            - GAT GCT CGG GCC TCT CAG AAC TAC CAG GAG CA - #A CTG CAA AAG TTC CAC          2767                                                                           Asp Ala Arg Ala Ser Gln Asn Tyr Gln Glu Gl - #n Leu Gln Lys Phe His            #           305                                                                - GCG CAG CAG GGT CGC AAG CGT GTC TCG GTC CC - #C GTC ATT GAC GGT CGT          2815                                                                           Ala Gln Gln Gly Arg Lys Arg Val Ser Val Pr - #o Val Ile Asp Gly Arg            #       320                                                                    - TCC GTC GAT TTG TAC CAG CTC AAA CTA GTC AT - #C TCA AGT CTG GGT GGC          2863                                                                           Ser Val Asp Leu Tyr Gln Leu Lys Leu Val Il - #e Ser Ser Leu Gly Gly            #   335                                                                        - TAC GAT GCT GTT TGC CGT GCT CGC AAG TGG TC - #C GAT GCT ACG CGT AAG          2911                                                                           Tyr Asp Ala Val Cys Arg Ala Arg Lys Trp Se - #r Asp Ala Thr Arg Lys            340                 3 - #45                 3 - #50                 3 -        #55                                                                            - ATC GGC TAC AGT GAC AAG GAA AGC GGT CAG CT - #C TCG ACG CAA GTC AAA          2959                                                                           Ile Gly Tyr Ser Asp Lys Glu Ser Gly Gln Le - #u Ser Thr Gln Val Lys            #               370                                                            - GCT GCC TAC ACC CGC ATC ATC TTG CCC TTT GA - #A GAG TTT CTT GCA AAA          3007                                                                           Ala Ala Tyr Thr Arg Ile Ile Leu Pro Phe Gl - #u Glu Phe Leu Ala Lys            #           385                                                                - GCA AAA GAG CAG TCT CGT CCT AAC GGA TCA TC - #G GTC AGC CCA CAG CTC          3055                                                                           Ala Lys Glu Gln Ser Arg Pro Asn Gly Ser Se - #r Val Ser Pro Gln Leu            #       400                                                                    - GCG CAG AGT GCC ATC ATG GGC GCC ACC GCC AG - #C ACG GAC ACC CAA GAG          3103                                                                           Ala Gln Ser Ala Ile Met Gly Ala Thr Ala Se - #r Thr Asp Thr Gln Glu            #   415                                                                        - AAT GGC GTT AAG CAC CCC TCC ATG TCG CCA AG - #C CTC GAC GCC GCC CCC          3151                                                                           Asn Gly Val Lys His Pro Ser Met Ser Pro Se - #r Leu Asp Ala Ala Pro            420                 4 - #25                 4 - #30                 4 -        #35                                                                            - AGT GGA GAT GCA GGT GAA CAC TTC AAA ACA CC - #C GAG CCT TTC ACT GCT          3199                                                                           Ser Gly Asp Ala Gly Glu His Phe Lys Thr Pr - #o Glu Pro Phe Thr Ala            #               450                                                            - GCT GGC GCT GCT GAG GCG CTC GCA AAT GCA AC - #T CCC GTC CTC GAG ACA          3247                                                                           Ala Gly Ala Ala Glu Ala Leu Ala Asn Ala Th - #r Pro Val Leu Glu Thr            #           465                                                                - CCC ACT CAA AGC CCT TCG ACT GTC GCA AGC AC - #A CGT CGC AGT GCG CGC          3295                                                                           Pro Thr Gln Ser Pro Ser Thr Val Ala Ser Th - #r Arg Arg Ser Ala Arg            #       480                                                                    - AAG AGA TCG GAA GCA ACC AGC ACA CCT GCT TC - #G TCG TCG CGT AAC TCT          3343                                                                           Lys Arg Ser Glu Ala Thr Ser Thr Pro Ala Se - #r Ser Ser Arg Asn Ser            #    495                                                                       - TTG CAG CTC ACC TCC ACA CCA ATG ACA CCT TT - #G ATC TCC AGA CGC AGA          3391                                                                           Leu Gln Leu Thr Ser Thr Pro Met Thr Pro Le - #u Ile Ser Arg Arg Arg            500                 5 - #05                 5 - #10                 5 -        #15                                                                            - AAG GGC GTT AGC CCT CAC CTT GAA GCA GAT TC - #T TAC CTG CGC GCT CAA          3439                                                                           Lys Gly Val Ser Pro His Leu Glu Ala Asp Se - #r Tyr Leu Arg Ala Gln            #                530                                                           - GCT GGC AAT CAG GCG CAA GAA GAG CAA ATG TG - #C GAA ATC TGC CTC CGA          3487                                                                           Ala Gly Asn Gln Ala Gln Glu Glu Gln Met Cy - #s Glu Ile Cys Leu Arg            #            545                                                               - GGC GAG GAT GGT CCC AAC ATG TTG CTC TGC GA - #C GAG TGC AAT CGT GGC          3535                                                                           Gly Glu Asp Gly Pro Asn Met Leu Leu Cys As - #p Glu Cys Asn Arg Gly            #        560                                                                   - TAC CAC ATG TAC TGT CTC CAA CCC GCG CTC AC - #T TCG ATC CCC AAA TCG          3583                                                                           Tyr His Met Tyr Cys Leu Gln Pro Ala Leu Th - #r Ser Ile Pro Lys Ser            #    575                                                                       - CAG TGG TTC TGC CCG CCT TGT CTT GTC GGC AC - #C GGT CAT GAT TTT GGT          3631                                                                           Gln Trp Phe Cys Pro Pro Cys Leu Val Gly Th - #r Gly His Asp Phe Gly            #595                                                                           - TTT GAC GAT GGT GAA ACA CAC AGC CTC TAC AC - #T TTT TGG CAA CGT GCT          3679                                                                           Phe Asp Asp Gly Glu Thr His Ser Leu Tyr Th - #r Phe Trp Gln Arg Ala            #                610                                                           - GAG GCA TTC AAG CGC GAT TGG TGG TCC AAA CA - #T CAA GAT CAC CTC TGG          3727                                                                           Glu Ala Phe Lys Arg Asp Trp Trp Ser Lys Hi - #s Gln Asp His Leu Trp            #            625                                                               - AGG CCC GAC TCG GAA GGC CTG GCG ACA TCT GA - #C TAC GAT CCG CCA ACG          3775                                                                           Arg Pro Asp Ser Glu Gly Leu Ala Thr Ser As - #p Tyr Asp Pro Pro Thr            #        640                                                                   - AAT GGT CTG GCT CGC CGT GTC CAC GGA ACC GA - #C CTG GTT GTG TCA GAG          3823                                                                           Asn Gly Leu Ala Arg Arg Val His Gly Thr As - #p Leu Val Val Ser Glu            #    655                                                                       - GAC GAC GTA GAG CGC GAA TTT TGG AGA CTA GT - #T CAT AGC CAG AAG GAA          3871                                                                           Asp Asp Val Glu Arg Glu Phe Trp Arg Leu Va - #l His Ser Gln Lys Glu            #675                                                                           - GAA GTA GAA GTC GAG TAT GGT GCT GAC GTT CA - #C TCT ACT ACG CAC GGC          3919                                                                           Glu Val Glu Val Glu Tyr Gly Ala Asp Val Hi - #s Ser Thr Thr His Gly            #                690                                                           - AGT GCC TTG CCC ACC CAA GAG ACT CAT CCC TT - #G AGT CTG TAT TCG CGC          3967                                                                           Ser Ala Leu Pro Thr Gln Glu Thr His Pro Le - #u Ser Leu Tyr Ser Arg            #            705                                                               - GAC AAG TGG AAC CTC AAT AAC CTA CCC ATC CT - #G CCT GGC TCG CTG CTC          4015                                                                           Asp Lys Trp Asn Leu Asn Asn Leu Pro Ile Le - #u Pro Gly Ser Leu Leu            #        720                                                                   - CAG TAC ATC AAG TCC GAC ATC TCG GGC ATG AC - #C GTC CCC TGG ATC TAT          4063                                                                           Gln Tyr Ile Lys Ser Asp Ile Ser Gly Met Th - #r Val Pro Trp Ile Tyr            #    735                                                                       - GTC GGA ATG ATT TTC TCC ACC TTC TGC TGG CA - #C AAC GAG GAT CAC TAC          4111                                                                           Val Gly Met Ile Phe Ser Thr Phe Cys Trp Hi - #s Asn Glu Asp His Tyr            #755                                                                           - ACT TAC TCG ATC AAC TAT CAG CAT TGG GGT GA - #G ACT AAG ACA TGG TAC          4159                                                                           Thr Tyr Ser Ile Asn Tyr Gln His Trp Gly Gl - #u Thr Lys Thr Trp Tyr            #                770                                                           - GGC ATT CCG GGT GAA GAT GCC GAA AAG TTC GA - #G AAT GCC ATG CGC AAG          4207                                                                           Gly Ile Pro Gly Glu Asp Ala Glu Lys Phe Gl - #u Asn Ala Met Arg Lys            #            785                                                               - GCG GCG CCC GAT TTA TTC GAG ACG CTG CCG GA - #C CTG CTC TTT CAT CTC          4255                                                                           Ala Ala Pro Asp Leu Phe Glu Thr Leu Pro As - #p Leu Leu Phe His Leu            #        800                                                                   - ACC ACC ATG ATG AGT CCC GAG AAG CTC AAG AA - #G GAA GGC GTC CGC GTT          4303                                                                           Thr Thr Met Met Ser Pro Glu Lys Leu Lys Ly - #s Glu Gly Val Arg Val            #    815                                                                       - GTG GCA TGT GAC CAA CGT GCC AAC GAG TTT GT - #C GTC ACT TTT CCC AAG          4351                                                                           Val Ala Cys Asp Gln Arg Ala Asn Glu Phe Va - #l Val Thr Phe Pro Lys            #835                                                                           - GCC TAC CAC AGC GGC TTT AAC CAC GGT CTC AA - #C CTG AAT GAA GCT GTC          4399                                                                           Ala Tyr His Ser Gly Phe Asn His Gly Leu As - #n Leu Asn Glu Ala Val            #                850                                                           - AAC TTT GCT CTG CCC GAC TGG ATC TTT GAC GA - #T CTC GAA TCT GTT CGG          4447                                                                           Asn Phe Ala Leu Pro Asp Trp Ile Phe Asp As - #p Leu Glu Ser Val Arg            #            865                                                               - AGG TAC CAG CGC TTC CGA AAG CCT GCC GTA TT - #C TCA CAC GAC CAG CTG          4495                                                                           Arg Tyr Gln Arg Phe Arg Lys Pro Ala Val Ph - #e Ser His Asp Gln Leu            #        880                                                                   - CTC ATT ACC GTC TCG CAG CAG AGT CAG ACC AT - #C GAA ACA GCC GTG TGG          4543                                                                           Leu Ile Thr Val Ser Gln Gln Ser Gln Thr Il - #e Glu Thr Ala Val Trp            #    895                                                                       - CTT GAG GCC GCC ATG CAA GAG ATG GTT GAT CG - #C GAG ATC GCA AAG CGC          4591                                                                           Leu Glu Ala Ala Met Gln Glu Met Val Asp Ar - #g Glu Ile Ala Lys Arg            900                 9 - #05                 9 - #10                 9 -        #15                                                                            - AAC GCA CTT CGT GAG ATC ATT CCG GAT CTC AA - #A GAA GAG GTA TAC GAC          4639                                                                           Asn Ala Leu Arg Glu Ile Ile Pro Asp Leu Ly - #s Glu Glu Val Tyr Asp            #               930                                                            - GAA GAT GTA GCC GAG AGC CAC TAC ATT TGC AG - #C CAC TGC ACT CTC TTT          4687                                                                           Glu Asp Val Ala Glu Ser His Tyr Ile Cys Se - #r His Cys Thr Leu Phe            #            945                                                               - TCC TAC CTC GGC CAG TTG ACA AGT CCA AAG AC - #C GAT GGT GTC GCT TGT          4735                                                                           Ser Tyr Leu Gly Gln Leu Thr Ser Pro Lys Th - #r Asp Gly Val Ala Cys            #        960                                                                   - CTC GAT CAC GGC TTC GAG GTG TGC AAC GCC GA - #T GCT CCC GTC AAG TGG          4783                                                                           Leu Asp His Gly Phe Glu Val Cys Asn Ala As - #p Ala Pro Val Lys Trp            #    975                                                                       - ACG TTG AAG CTT CGC TTC TCG GAC GAT CAG CT - #T CGC TCC ATT CTA GCG          4831                                                                           Thr Leu Lys Leu Arg Phe Ser Asp Asp Gln Le - #u Arg Ser Ile Leu Ala            #995                                                                           - AAG GTC TGT GAG CGG GCA GCA GTG CCG CGC AA - #C TGG ATT CAG CGC CTC          4879                                                                           Lys Val Cys Glu Arg Ala Ala Val Pro Arg As - #n Trp Ile Gln Arg Leu            #              10105                                                           - AAG AAG ACC CTT GCT CTT GGC CCG ACT CCA CC - #T CTC AAG ACG CTG AGG          4927                                                                           Lys Lys Thr Leu Ala Leu Gly Pro Thr Pro Pr - #o Leu Lys Thr Leu Arg            #          10250                                                               - TCG TTG CTG CAC GAA GGC GAA AAG ATT GCC TT - #C TCG CTA GAG CCG CTC          4975                                                                           Ser Leu Leu His Glu Gly Glu Lys Ile Ala Ph - #e Ser Leu Glu Pro Leu            #      10405                                                                   - GAG GAT CTC AGG ACC TTT GTC ACC TGC GCC AA - #C TCG TGG GTG GAG CGG          5023                                                                           Glu Asp Leu Arg Thr Phe Val Thr Cys Ala As - #n Ser Trp Val Glu Arg            #  10550                                                                       - GCC AAT GTT TTC CTG ATG CGC AAG TTG CAT AA - #G AGA CGC GGC GAG CCT          5071                                                                           Ala Asn Val Phe Leu Met Arg Lys Leu His Ly - #s Arg Arg Gly Glu Pro            #               10751065 - #                1070                               - GCA GCT GCT CCT GCT GGG AGG CGC CGA CGA TC - #C AAG GGC GGT GCT GTG          5119                                                                           Ala Ala Ala Pro Ala Gly Arg Arg Arg Arg Se - #r Lys Gly Gly Ala Val            #              10905                                                           - GCT GAC GAT AGC TTC ACT AGA AGG CAA AGC TT - #G GAC GCT TCG GTC GAC          5167                                                                           Ala Asp Asp Ser Phe Thr Arg Arg Gln Ser Le - #u Asp Ala Ser Val Asp            #          11050                                                               - GAT GCC GAA TCC ACT TCC GAT CGA AGT CCC GA - #A GCC TTG TAT GCG TTG          5215                                                                           Asp Ala Glu Ser Thr Ser Asp Arg Ser Pro Gl - #u Ala Leu Tyr Ala Leu            #      11205                                                                   - ATC GGA GAG CTC GAC AGC CTT CAC TTT GAC GC - #G CCT GAG ATT GCA TCG          5263                                                                           Ile Gly Glu Leu Asp Ser Leu His Phe Asp Al - #a Pro Glu Ile Ala Ser            #  11350                                                                       - CTT CGC ACT ATG GCG CAA GAG CTC GAG GAG TT - #C ATT GGC CGG TGT GAC          5311                                                                           Leu Arg Thr Met Ala Gln Glu Leu Glu Glu Ph - #e Ile Gly Arg Cys Asp            #              11550  5                                                        - GAG GTC CTA CAA CAG GGT GAC GAG ACT AAT CT - #C AAA GAC TGT GAA AGC          5359                                                                           Glu Val Leu Gln Gln Gly Asp Glu Thr Asn Le - #u Lys Asp Cys Glu Ser            #              11705                                                           - ATC CTG ACG CTC GGC AGC TCT CTC AAT GTG GA - #C GCG CCT CAG ATC AAA          5407                                                                           Ile Leu Thr Leu Gly Ser Ser Leu Asn Val As - #p Ala Pro Gln Ile Lys            #          11850                                                               - GAG CTC TCC GAC TAT GTC GAG CGT CGC AAG TG - #G ATC CAG GAA GTC ACA          5455                                                                           Glu Leu Ser Asp Tyr Val Glu Arg Arg Lys Tr - #p Ile Gln Glu Val Thr            #      12005                                                                   - GAA TCG TTC GAC ACA TAT CTC TAT TAC CAC GA - #A GTT GCG GAA CTG TTG          5503                                                                           Glu Ser Phe Asp Thr Tyr Leu Tyr Tyr His Gl - #u Val Ala Glu Leu Leu            #  12150                                                                       - GAT CGC GCC GAC AGC TGT GGT CTA CAA GAT CA - #C GAG CTG CGC AAG AAT          5551                                                                           Asp Arg Ala Asp Ser Cys Gly Leu Gln Asp Hi - #s Glu Leu Arg Lys Asn            #              12350  5                                                        - CTT GAG CAG AGA CTC GAA GCC GGC CAA CGC TG - #G ACT GAA AGT GCA AGG          5599                                                                           Leu Glu Gln Arg Leu Glu Ala Gly Gln Arg Tr - #p Thr Glu Ser Ala Arg            #              12505                                                           - GAA GCG CTG GGA GGC TCT CAG CCT ATA ACA AT - #C GAC GTG CTT CAA GAG          5647                                                                           Glu Ala Leu Gly Gly Ser Gln Pro Ile Thr Il - #e Asp Val Leu Gln Glu            #          12650                                                               - CTT TCC GAG TCG TCA GCT GAT GTT CCT GTT GT - #G CTC GAA GTG GCT CAG          5695                                                                           Leu Ser Glu Ser Ser Ala Asp Val Pro Val Va - #l Leu Glu Val Ala Gln            #      12805                                                                   - GAT GTT ACC GAC GCT CTC TCC AAG GCC AAA GA - #G CTG CAA AAG ACC ATC          5743                                                                           Asp Val Thr Asp Ala Leu Ser Lys Ala Lys Gl - #u Leu Gln Lys Thr Ile            #  12950                                                                       - CAG ACA CTG TAC AAG GCA TTA CAG ACG GGA GC - #T CAC GGC CAT TCT GCA          5791                                                                           Gln Thr Leu Tyr Lys Ala Leu Gln Thr Gly Al - #a His Gly His Ser Ala            #              13150  5                                                        - GCC GAT GCG GAT GGT GAC CTA TCA ATG ATC TC - #G ATC TCG GAA AAT GGC          5839                                                                           Ala Asp Ala Asp Gly Asp Leu Ser Met Ile Se - #r Ile Ser Glu Asn Gly            #              13305                                                           - GAA GCT GCC GAG CGT GTG GCT CTG CTT CCT GA - #C GCT CGT CGC GTG CTT          5887                                                                           Glu Ala Ala Glu Arg Val Ala Leu Leu Pro As - #p Ala Arg Arg Val Leu            #          13450                                                               - CGT GCC GCC AGG TCC AAC AAA CTG GAG CTT GA - #G CAC GCG CAA GAC ATT          5935                                                                           Arg Ala Ala Arg Ser Asn Lys Leu Glu Leu Gl - #u His Ala Gln Asp Ile            #      13605                                                                   - GAA AAG GCC GTC CAA GTC TAC GAT GCA TGG CG - #A GCT GCG TTC AAC CAG          5983                                                                           Glu Lys Ala Val Gln Val Tyr Asp Ala Trp Ar - #g Ala Ala Phe Asn Gln            #  13750                                                                       - ATC ATG CAG ACT ATT GCC GGT GGA TCT CGC CG - #C CTC ACG GAC GCA GAC          6031                                                                           Ile Met Gln Thr Ile Ala Gly Gly Ser Arg Ar - #g Leu Thr Asp Ala Asp            #               13951385 - #                1390                               - CGC GAC GAG GAG CTC GAC AAG CTG GTG GAG CG - #A GTC GAG GAT GCC ACC          6079                                                                           Arg Asp Glu Glu Leu Asp Lys Leu Val Glu Ar - #g Val Glu Asp Ala Thr            #              14105                                                           - GAC CCT GCC GAC GAC CAG AAC AAA CCC AAT GC - #A CGC AAC TGT ATC TGC          6127                                                                           Asp Pro Ala Asp Asp Gln Asn Lys Pro Asn Al - #a Arg Asn Cys Ile Cys            #          14250                                                               - AGG AGC TCA ATG CCC ATC GCC ATT CCT TCG TC - #G TCA GGG GCA GAA TGC          6175                                                                           Arg Ser Ser Met Pro Ile Ala Ile Pro Ser Se - #r Ser Gly Ala Glu Cys            #      14405                                                                   - TCT CGC TGT CGC GTG CAG TAC CAT CTA TCG TG - #C ATC AAG GTG CGC TCC          6223                                                                           Ser Arg Cys Arg Val Gln Tyr His Leu Ser Cy - #s Ile Lys Val Arg Ser            #  14550                                                                       - TCT GAG GTA TCA CGC GCC GAG GGC GGC TGG GT - #T TGT CCA TTC TGC CCG          6271                                                                           Ser Glu Val Ser Arg Ala Glu Gly Gly Trp Va - #l Cys Pro Phe Cys Pro            #               14751465 - #                1470                               - TGG TAC GGG AGC GCT CCG TTC CTC AAA ATG CG - #C AAG GCG ATC AGC ATT          6319                                                                           Trp Tyr Gly Ser Ala Pro Phe Leu Lys Met Ar - #g Lys Ala Ile Ser Ile            #              14905                                                           - GCT GAC CTT TCG AAG CTT GTA TAC GAT CAA GA - #T CAT CGT CGA GAC CAG          6367                                                                           Ala Asp Leu Ser Lys Leu Val Tyr Asp Gln As - #p His Arg Arg Asp Gln            #          15050                                                               - TTC AAA TTC CTC CCT CTG GAA TGG GAC GCC AT - #C GAG GAA GTG GTT GCC          6415                                                                           Phe Lys Phe Leu Pro Leu Glu Trp Asp Ala Il - #e Glu Glu Val Val Ala            #      15205                                                                   - AAG GCA AAG CGA TTC GAG ACG GCC GCT AAG CG - #A ATG ATC AAA ACA CTT          6463                                                                           Lys Ala Lys Arg Phe Glu Thr Ala Ala Lys Ar - #g Met Ile Lys Thr Leu            #  15350                                                                       - TCG CTG ATG CGC AGA GAT CAA AAG CAG GTC AT - #C CTT GCC CAC TGG CTA          6511                                                                           Ser Leu Met Arg Arg Asp Gln Lys Gln Val Il - #e Leu Ala His Trp Leu            #              15550  5                                                        - CGT CGG TCC ATT GGC TGC CCC GTC GAT GTC TT - #G GGA CCA GAG AAA GTC          6559                                                                           Arg Arg Ser Ile Gly Cys Pro Val Asp Val Le - #u Gly Pro Glu Lys Val            #              15705                                                           - AAC ATG CTT GAC CTC ATC AGC GAA AAT TTG CT - #C GCC CTT GGT TCA CAG          6607                                                                           Asn Met Leu Asp Leu Ile Ser Glu Asn Leu Le - #u Ala Leu Gly Ser Gln            #          15850                                                               - CAG GGT GAT GCT GCA CCC ATG GCG CCT GTT GA - #G CGT ATC AAG GCG TCG          6655                                                                           Gln Gly Asp Ala Ala Pro Met Ala Pro Val Gl - #u Arg Ile Lys Ala Ser            #      16005                                                                   - ACT CCA GCG CGA TCC GAC GAG CGC ACG GAA GA - #A ACA ACG CCC TTG CCT          6703                                                                           Thr Pro Ala Arg Ser Asp Glu Arg Thr Glu Gl - #u Thr Thr Pro Leu Pro            #  16150                                                                       - CGC TCG TCT CGC GTT CCA GCC CCT GCC GAT CG - #C GAC TCA GGA TCT CCA          6751                                                                           Arg Ser Ser Arg Val Pro Ala Pro Ala Asp Ar - #g Asp Ser Gly Ser Pro            #               16351625 - #                1630                               - GCT GTC CGA GAC GAT CGC AAG CGC AAA GCC AA - #G AGA GGC AAG CGT GCC          6799                                                                           Ala Val Arg Asp Asp Arg Lys Arg Lys Ala Ly - #s Arg Gly Lys Arg Ala            #              16505                                                           - AAG CTC GTC TTC CAG GAG GAG ATT GGT ATC GG - #T GCT TAC CGC GAT CGT          6847                                                                           Lys Leu Val Phe Gln Glu Glu Ile Gly Ile Gl - #y Ala Tyr Arg Asp Arg            #          16650                                                               - CAG CCC ATC TAC TGT CTG TGC CAT GAG CCA GA - #G AGC GGT CGC ATG ATT          6895                                                                           Gln Pro Ile Tyr Cys Leu Cys His Glu Pro Gl - #u Ser Gly Arg Met Ile            #      16805                                                                   - GCT TGT GAC AAG TGC ATG CTC TGG TTT CAT AC - #C AAT TGT GTT CGC CTC          6943                                                                           Ala Cys Asp Lys Cys Met Leu Trp Phe His Th - #r Asn Cys Val Arg Leu            #  16950                                                                       - GAT GAT CCG CCG AAT CTC GGA AAT GAG CCG TG - #G ATA TGT CCC ATG TGC          6991                                                                           Asp Asp Pro Pro Asn Leu Gly Asn Glu Pro Tr - #p Ile Cys Pro Met Cys            #               17151705 - #                1710                               - TGC ATC AAG GCG GAG CGC AAG TAT CCT CAG GC - #C GAA GTC AGG GTC AAA          7039                                                                           Cys Ile Lys Ala Glu Arg Lys Tyr Pro Gln Al - #a Glu Val Arg Val Lys            #              17305                                                           - GAC ATT GGC GTC ACC GAC CCG GAT CTG TGG CT - #C GAC ATC CGT GCC ACG          7087                                                                           Asp Ile Gly Val Thr Asp Pro Asp Leu Trp Le - #u Asp Ile Arg Ala Thr            #          17450                                                               - CTG CGA TCG CTC GAG AAG CCT GTC AGC AAG AT - #T CAG TCG TGG ACC AGC          7135                                                                           Leu Arg Ser Leu Glu Lys Pro Val Ser Lys Il - #e Gln Ser Trp Thr Ser            #      17605                                                                   - CCG GAG AAC AAG CGC ATT GTG CTA CAT CTG GA - #A AAG TTC ACA CCG GCT          7183                                                                           Pro Glu Asn Lys Arg Ile Val Leu His Leu Gl - #u Lys Phe Thr Pro Ala            #  17750                                                                       - ATC CAT GCT GAG GAG GTG CAC TCG CAG ATC AC - #C AAA CGT GCG CGT CTC          7231                                                                           Ile His Ala Glu Glu Val His Ser Gln Ile Th - #r Lys Arg Ala Arg Leu            #               17951785 - #                1790                               - GAG TCC GAC ACG CCG AGC AAG GCG CGA GTG TC - #T CTG GGC CGC TCT GAT          7279                                                                           Glu Ser Asp Thr Pro Ser Lys Ala Arg Val Se - #r Leu Gly Arg Ser Asp            #              18105                                                           - TCG ATC TCG ACG CCA GCA AAG GAG AGC GGC GC - #C GTT CCT TAT GCG GCA          7327                                                                           Ser Ile Ser Thr Pro Ala Lys Glu Ser Gly Al - #a Val Pro Tyr Ala Ala            #          18250                                                               - GCT CCT GTG CCC AGC GAG GCT GTT CGA GGT AT - #C GTG CCT GCT TTG ACG          7375                                                                           Ala Pro Val Pro Ser Glu Ala Val Arg Gly Il - #e Val Pro Ala Leu Thr            #      18405                                                                   - CCG GCG GCT GAT TCA CCC GCC TCC AGA TCA GG - #A AGG AAC GAC GAT TCA          7423                                                                           Pro Ala Ala Asp Ser Pro Ala Ser Arg Ser Gl - #y Arg Asn Asp Asp Ser            #  18550                                                                       - TTT GCT GCA GCC TCG CCT CCT TTG TGG GAT GC - #C AAG ACT GGA CCA TCT          7471                                                                           Phe Ala Ala Ala Ser Pro Pro Leu Trp Asp Al - #a Lys Thr Gly Pro Ser            #              18750  5                                                        - CCT GGC AAC GCC AGC ATC GAA TGG GCG CAG TC - #G GCA CGT CGA CGA TAT          7519                                                                           Pro Gly Asn Ala Ser Ile Glu Trp Ala Gln Se - #r Ala Arg Arg Arg Tyr            #              18905                                                           - GCC GAA GGC ATG GAC AAC CTC TAC CGT CGC GG - #C ATC ACG GAC ACG ATG          7567                                                                           Ala Glu Gly Met Asp Asn Leu Tyr Arg Arg Gl - #y Ile Thr Asp Thr Met            #          19050                                                               - CTG GTG CGA TTC TAC GTT GGA TGG AAT GGA CG - #T ACG CTC TTT CAT CCG          7615                                                                           Leu Val Arg Phe Tyr Val Gly Trp Asn Gly Ar - #g Thr Leu Phe His Pro            #      19205                                                                   - GTA CGA GAC TCA GCG GGC AAC ATT GTA GAG GT - #A TCT CTG GGC GAG AAC          7663                                                                           Val Arg Asp Ser Ala Gly Asn Ile Val Glu Va - #l Ser Leu Gly Glu Asn            #  19350                                                                       - GTC CGT CTG CAT CCA GAT GAT CCC GAG GGC GT - #G CGG GTA ATT CGT GCT          7711                                                                           Val Arg Leu His Pro Asp Asp Pro Glu Gly Va - #l Arg Val Ile Arg Ala            #              19550  5                                                        - GCC ATT GAA CGA CAC AGC GTC AAA GCG GAC CG - #T TTA GCC GCA AGT CAT          7759                                                                           Ala Ile Glu Arg His Ser Val Lys Ala Asp Ar - #g Leu Ala Ala Ser His            #              19705                                                           - GGC TAT GGC GGC GAG ATG GAC GAT CAT GTG TA - #C TCT CGC AAC GCT TAC          7807                                                                           Gly Tyr Gly Gly Glu Met Asp Asp His Val Ty - #r Ser Arg Asn Ala Tyr            #          19850                                                               - AGT CGC GAC GAC GGA CGC TAT ACA GCT CAG CG - #A CGC GAT CCT CCG GTG          7855                                                                           Ser Arg Asp Asp Gly Arg Tyr Thr Ala Gln Ar - #g Arg Asp Pro Pro Val            #      20005                                                                   - GTA CCG TCG AAT GGC AGA TTC AGC ATG AGA TC - #G CCT GCC ACG ATT CCT          7903                                                                           Val Pro Ser Asn Gly Arg Phe Ser Met Arg Se - #r Pro Ala Thr Ile Pro            #  20150                                                                       - TCG CAA CGA CTT GGC AGC GAT CGC GAC TAT GA - #A CGC GAG CGG GAG CGT          7951                                                                           Ser Gln Arg Leu Gly Ser Asp Arg Asp Tyr Gl - #u Arg Glu Arg Glu Arg            #              20350  5                                                        - GAC GGG GAT CTT CAT GAT GCC CGT GAT GGT CG - #T GAT GGC CGA TAT GGC          7999                                                                           Asp Gly Asp Leu His Asp Ala Arg Asp Gly Ar - #g Asp Gly Arg Tyr Gly            #              20505                                                           - GAT TCA TTA CGT TCT CCG GCG GCA CCA GTG GC - #G GCG ATG ACT GCC CCT          8047                                                                           Asp Ser Leu Arg Ser Pro Ala Ala Pro Val Al - #a Ala Met Thr Ala Pro            #          20650                                                               - GGT gca ttg gac acc tcg ccg gcg ctc cga ac - #g aat cta gcg cgc gaa          8095                                                                           Gly Ala Leu Asp Thr Ser Pro Ala Leu Arg Th - #r Asn Leu Ala Arg Glu            #      20805                                                                   - GTC GTG CCG ACA TAC GCG CGA AGC TCA GCT AA - #T GCA TCG GCA ACC ACA          8143                                                                           Val Val Pro Thr Tyr Ala Arg Ser Ser Ala As - #n Ala Ser Ala Thr Thr            #  20950                                                                       - AGT CCA TAC ACT GGC GCT GCT TCG ACG TAC AG - #C ATT TAT TCG GCA TCT          8191                                                                           Ser Pro Tyr Thr Gly Ala Ala Ser Thr Tyr Se - #r Ile Tyr Ser Ala Ser            #              21150  5                                                        - GAC AGA GCG GCA TCT TAT CCG GTG GGT CGC AG - #T TCG ATT TCG CAG GCG          8239                                                                           Asp Arg Ala Ala Ser Tyr Pro Val Gly Arg Se - #r Ser Ile Ser Gln Ala            #              21305                                                           - GAT CTG GAT GGA AAT AGG GGG GGA CCT CCA CC - #G ATG GCG ATG TAT GCT          8287                                                                           Asp Leu Asp Gly Asn Arg Gly Gly Pro Pro Pr - #o Met Ala Met Tyr Ala            #          21450                                                               - TCT GCC AAG GCT GAG CCT GTC GCA AAT GGG TC - #T ACG TTT TCG GCA CTG          8335                                                                           Ser Ala Lys Ala Glu Pro Val Ala Asn Gly Se - #r Thr Phe Ser Ala Leu            #      21605                                                                   - GAC CCA GCG ATG ATG GCA GAC GAT GCA GCA GG - #A CAG ATC GAT CCC AAT          8383                                                                           Asp Pro Ala Met Met Ala Asp Asp Ala Ala Gl - #y Gln Ile Asp Pro Asn            #  21750                                                                       - TTG ACG AGC AGT CCG GTT CTA GCT TCC AAC TC - #G GCA GTT CCC GCA CCG          8431                                                                           Leu Thr Ser Ser Pro Val Leu Ala Ser Asn Se - #r Ala Val Pro Ala Pro            #              21950  5                                                        - TCG ACC GCA CCG GCA GCA GCA CAT GGT GTT CG - #G AGC GAG ACG AGG AGC          8479                                                                           Ser Thr Ala Pro Ala Ala Ala His Gly Val Ar - #g Ser Glu Thr Arg Ser            #              22105                                                           - CGT CCA CCC AGC GCA GGC AAC GAA GTC GCC CA - #T GAA GCC GGT TCC GCG          8527                                                                           Arg Pro Pro Ser Ala Gly Asn Glu Val Ala Hi - #s Glu Ala Gly Ser Ala            #          22250                                                               - AAA GCA CCC CCG GGT GCA CCC TCG GGT GGC CA - #C AGT GGC GAG ATC AAG          8575                                                                           Lys Ala Pro Pro Gly Ala Pro Ser Gly Gly Hi - #s Ser Gly Glu Ile Lys            #      22405                                                                   - GAG CAC AAC CCA GAC GAG CAC GAG CTC GAG AG - #T GTT CGT CAG CAG GCT          8623                                                                           Glu His Asn Pro Asp Glu His Glu Leu Glu Se - #r Val Arg Gln Gln Ala            #  22550                                                                       - AGA CAG ATG GCG CGG AAA ATG CGA CCA GAC GC - #T TCC GAG GCC GAC ATC          8671                                                                           Arg Gln Met Ala Arg Lys Met Arg Pro Asp Al - #a Ser Glu Ala Asp Ile            #              22750  5                                                        - GAA CGA TTG GTT CAA AAC TTT ATC GGT GGT GG - #A GAG TCT AAG TAG              8716                                                                           Glu Arg Leu Val Gln Asn Phe Ile Gly Gly Gl - #y Glu Ser Lys                    #             22905                                                            - CGCGCCCTGC CAAGAATACA TGCGGTTCAA TGAAATTGTG AATCAAGAAT CA - #TGAATCGT        8776                                                                           - GAATGTACAA TCGATATCAC ACCACGCAGC ACGAATAGCG AGATTCACGA TT - #CACGAATC        8836                                                                           - GTGATTCGTG AATCACGAAT GTGCGAACGA AAATCAGGGT TTGGATTCCA AG - #AGAAAGAA        8896                                                                           #     8931         TGAG TCAAACGAGT CTAGA                                       - (2) INFORMATION FOR SEQ ID NO: 2:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 2289 amino                                                         (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 #2:   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:                                    - Met Thr Ser Thr Lys Thr Ser Ser Gln Pro Se - #r Gly Ser Ser Asp Thr          #                15                                                            - Pro Gln Arg Ile Val Val Lys Ser Val Asn Gl - #y His Glu Pro Ile Lys          #            30                                                                - Val Glu Pro Val Ser Ser Ser Ser Ala Ser Le - #u Leu His Ser Thr Pro          #        45                                                                    - Pro Arg Leu Ala Thr Pro Leu Ser Ser Pro Th - #r Lys Ser Ala Ala Pro          #    60                                                                        - Ser Ser Pro Ser Lys Ser Pro Gly Arg Ala Ar - #g Arg Val Asp Pro Val          #80                                                                            - Leu Ile Ser Ser Arg Glu Phe Gly Pro Ser Al - #a Gly Gly Asp Ser Asp          #                95                                                            - Asp Asp Glu Phe Asn Asn Gly Glu Pro Glu Va - #l Tyr Lys Gly Val Asn          #           110                                                                - Thr Thr Ala Lys Arg Leu Ser Arg Lys Ser Ly - #s Ala Asp Ala Met Phe          #       125                                                                    - Ala Met Ser Val Lys Glu Ser Ser Pro Val Hi - #s Ala His Ala Thr Ser          #   140                                                                        - Thr Ser Thr Thr Ser Asn Ala Pro Thr Ala Il - #e Pro Gly Asn Pro Ala          #             160                                                              - Ser His Pro Ala Arg Lys Met Phe Gln His Gl - #n Pro Phe Pro Pro Leu          #           175                                                                - Val Phe Asp Thr Asp Pro Ile Ser Ser Ile Se - #r Gln Ser Pro Ser Ala          #       190                                                                    - Ser Asn Ala Ala Gln Pro Pro Ile Pro Thr Hi - #s Ala Ser Thr Pro Arg          #   205                                                                        - Cys Pro Pro Pro Arg Leu Arg Pro Arg Leu Ph - #e Glu Leu Asp Glu Ala          210                 2 - #15                 2 - #20                 2 -        #25                                                                            - Pro Thr Phe Tyr Pro Ser Pro Glu Glu Phe Se - #r Asp Pro Met Lys Tyr          #               240                                                            - Ile Ala Trp Ile Ala Asp Pro Gln Gly Gly As - #n Gly Lys Ala Tyr Gly          #           255                                                                - Ile Val Lys Ile Val Pro Pro Gln Gly Trp As - #n Pro Glu Cys Val Leu          #       270                                                                    - Asp Glu Gln Thr Phe Arg Phe Arg Thr Arg Va - #l Gln Leu Leu Asn Ser          #   285                                                                        - Leu Ser Ala Asp Ala Arg Ala Ser Gln Asn Ty - #r Gln Glu Gln Leu Gln          290                 2 - #95                 3 - #00                 3 -        #05                                                                            - Lys Phe His Ala Gln Gln Gly Arg Lys Arg Va - #l Ser Val Pro Val Ile          #               320                                                            - Asp Gly Arg Ser Val Asp Leu Tyr Gln Leu Ly - #s Leu Val Ile Ser Ser          #           335                                                                - Leu Gly Gly Tyr Asp Ala Val Cys Arg Ala Ar - #g Lys Trp Ser Asp Ala          #       350                                                                    - Thr Arg Lys Ile Gly Tyr Ser Asp Lys Glu Se - #r Gly Gln Leu Ser Thr          #   365                                                                        - Gln Val Lys Ala Ala Tyr Thr Arg Ile Ile Le - #u Pro Phe Glu Glu Phe          370                 3 - #75                 3 - #80                 3 -        #85                                                                            - Leu Ala Lys Ala Lys Glu Gln Ser Arg Pro As - #n Gly Ser Ser Val Ser          #               400                                                            - Pro Gln Leu Ala Gln Ser Ala Ile Met Gly Al - #a Thr Ala Ser Thr Asp          #           415                                                                - Thr Gln Glu Asn Gly Val Lys His Pro Ser Me - #t Ser Pro Ser Leu Asp          #       430                                                                    - Ala Ala Pro Ser Gly Asp Ala Gly Glu His Ph - #e Lys Thr Pro Glu Pro          #   445                                                                        - Phe Thr Ala Ala Gly Ala Ala Glu Ala Leu Al - #a Asn Ala Thr Pro Val          450                 4 - #55                 4 - #60                 4 -        #65                                                                            - Leu Glu Thr Pro Thr Gln Ser Pro Ser Thr Va - #l Ala Ser Thr Arg Arg          #               480                                                            - Ser Ala Arg Lys Arg Ser Glu Ala Thr Ser Th - #r Pro Ala Ser Ser Ser          #           495                                                                - Arg Asn Ser Leu Gln Leu Thr Ser Thr Pro Me - #t Thr Pro Leu Ile Ser          #       510                                                                    - Arg Arg Arg Lys Gly Val Ser Pro His Leu Gl - #u Ala Asp Ser Tyr Leu          #   525                                                                        - Arg Ala Gln Ala Gly Asn Gln Ala Gln Glu Gl - #u Gln Met Cys Glu Ile          530                 5 - #35                 5 - #40                 5 -        #45                                                                            - Cys Leu Arg Gly Glu Asp Gly Pro Asn Met Le - #u Leu Cys Asp Glu Cys          #               560                                                            - Asn Arg Gly Tyr His Met Tyr Cys Leu Gln Pr - #o Ala Leu Thr Ser Ile          #           575                                                                - Pro Lys Ser Gln Trp Phe Cys Pro Pro Cys Le - #u Val Gly Thr Gly His          #       590                                                                    - Asp Phe Gly Phe Asp Asp Gly Glu Thr His Se - #r Leu Tyr Thr Phe Trp          #   605                                                                        - Gln Arg Ala Glu Ala Phe Lys Arg Asp Trp Tr - #p Ser Lys His Gln Asp          610                 6 - #15                 6 - #20                 6 -        #25                                                                            - His Leu Trp Arg Pro Asp Ser Glu Gly Leu Al - #a Thr Ser Asp Tyr Asp          #               640                                                            - Pro Pro Thr Asn Gly Leu Ala Arg Arg Val Hi - #s Gly Thr Asp Leu Val          #           655                                                                - Val Ser Glu Asp Asp Val Glu Arg Glu Phe Tr - #p Arg Leu Val His Ser          #       670                                                                    - Gln Lys Glu Glu Val Glu Val Glu Tyr Gly Al - #a Asp Val His Ser Thr          #   685                                                                        - Thr His Gly Ser Ala Leu Pro Thr Gln Glu Th - #r His Pro Leu Ser Leu          690                 6 - #95                 7 - #00                 7 -        #05                                                                            - Tyr Ser Arg Asp Lys Trp Asn Leu Asn Asn Le - #u Pro Ile Leu Pro Gly          #               720                                                            - Ser Leu Leu Gln Tyr Ile Lys Ser Asp Ile Se - #r Gly Met Thr Val Pro          #           735                                                                - Trp Ile Tyr Val Gly Met Ile Phe Ser Thr Ph - #e Cys Trp His Asn Glu          #       750                                                                    - Asp His Tyr Thr Tyr Ser Ile Asn Tyr Gln Hi - #s Trp Gly Glu Thr Lys          #   765                                                                        - Thr Trp Tyr Gly Ile Pro Gly Glu Asp Ala Gl - #u Lys Phe Glu Asn Ala          770                 7 - #75                 7 - #80                 7 -        #85                                                                            - Met Arg Lys Ala Ala Pro Asp Leu Phe Glu Th - #r Leu Pro Asp Leu Leu          #               800                                                            - Phe His Leu Thr Thr Met Met Ser Pro Glu Ly - #s Leu Lys Lys Glu Gly          #           815                                                                - Val Arg Val Val Ala Cys Asp Gln Arg Ala As - #n Glu Phe Val Val Thr          #       830                                                                    - Phe Pro Lys Ala Tyr His Ser Gly Phe Asn Hi - #s Gly Leu Asn Leu Asn          #   845                                                                        - Glu Ala Val Asn Phe Ala Leu Pro Asp Trp Il - #e Phe Asp Asp Leu Glu          850                 8 - #55                 8 - #60                 8 -        #65                                                                            - Ser Val Arg Arg Tyr Gln Arg Phe Arg Lys Pr - #o Ala Val Phe Ser His          #               880                                                            - Asp Gln Leu Leu Ile Thr Val Ser Gln Gln Se - #r Gln Thr Ile Glu Thr          #           895                                                                - Ala Val Trp Leu Glu Ala Ala Met Gln Glu Me - #t Val Asp Arg Glu Ile          #       910                                                                    - Ala Lys Arg Asn Ala Leu Arg Glu Ile Ile Pr - #o Asp Leu Lys Glu Glu          #   925                                                                        - Val Tyr Asp Glu Asp Val Ala Glu Ser His Ty - #r Ile Cys Ser His Cys          930                 9 - #35                 9 - #40                 9 -        #45                                                                            - Thr Leu Phe Ser Tyr Leu Gly Gln Leu Thr Se - #r Pro Lys Thr Asp Gly          #               960                                                            - Val Ala Cys Leu Asp His Gly Phe Glu Val Cy - #s Asn Ala Asp Ala Pro          #           975                                                                - Val Lys Trp Thr Leu Lys Leu Arg Phe Ser As - #p Asp Gln Leu Arg Ser          #       990                                                                    - Ile Leu Ala Lys Val Cys Glu Arg Ala Ala Va - #l Pro Arg Asn Trp Ile          #  10050                                                                       - Gln Arg Leu Lys Lys Thr Leu Ala Leu Gly Pr - #o Thr Pro Pro Leu Lys          #               10251015 - #                1020                               - Thr Leu Arg Ser Leu Leu His Glu Gly Glu Ly - #s Ile Ala Phe Ser Leu          #              10405                                                           - Glu Pro Leu Glu Asp Leu Arg Thr Phe Val Th - #r Cys Ala Asn Ser Trp          #          10550                                                               - Val Glu Arg Ala Asn Val Phe Leu Met Arg Ly - #s Leu His Lys Arg Arg          #      10705                                                                   - Gly Glu Pro Ala Ala Ala Pro Ala Gly Arg Ar - #g Arg Arg Ser Lys Gly          #  10850                                                                       - Gly Ala Val Ala Asp Asp Ser Phe Thr Arg Ar - #g Gln Ser Leu Asp Ala          #               11051095 - #                1100                               - Ser Val Asp Asp Ala Glu Ser Thr Ser Asp Ar - #g Ser Pro Glu Ala Leu          #              11205                                                           - Tyr Ala Leu Ile Gly Glu Leu Asp Ser Leu Hi - #s Phe Asp Ala Pro Glu          #          11350                                                               - Ile Ala Ser Leu Arg Thr Met Ala Gln Glu Le - #u Glu Glu Phe Ile Gly          #      11505                                                                   - Arg Cys Asp Glu Val Leu Gln Gln Gly Asp Gl - #u Thr Asn Leu Lys Asp          #  11650                                                                       - Cys Glu Ser Ile Leu Thr Leu Gly Ser Ser Le - #u Asn Val Asp Ala Pro          #               11851175 - #                1180                               - Gln Ile Lys Glu Leu Ser Asp Tyr Val Glu Ar - #g Arg Lys Trp Ile Gln          #              12005                                                           - Glu Val Thr Glu Ser Phe Asp Thr Tyr Leu Ty - #r Tyr His Glu Val Ala          #          12150                                                               - Glu Leu Leu Asp Arg Ala Asp Ser Cys Gly Le - #u Gln Asp His Glu Leu          #      12305                                                                   - Arg Lys Asn Leu Glu Gln Arg Leu Glu Ala Gl - #y Gln Arg Trp Thr Glu          #  12450                                                                       - Ser Ala Arg Glu Ala Leu Gly Gly Ser Gln Pr - #o Ile Thr Ile Asp Val          #               12651255 - #                1260                               - Leu Gln Glu Leu Ser Glu Ser Ser Ala Asp Va - #l Pro Val Val Leu Glu          #              12805                                                           - Val Ala Gln Asp Val Thr Asp Ala Leu Ser Ly - #s Ala Lys Glu Leu Gln          #          12950                                                               - Lys Thr Ile Gln Thr Leu Tyr Lys Ala Leu Gl - #n Thr Gly Ala His Gly          #      13105                                                                   - His Ser Ala Ala Asp Ala Asp Gly Asp Leu Se - #r Met Ile Ser Ile Ser          #  13250                                                                       - Glu Asn Gly Glu Ala Ala Glu Arg Val Ala Le - #u Leu Pro Asp Ala Arg          #               13451335 - #                1340                               - Arg Val Leu Arg Ala Ala Arg Ser Asn Lys Le - #u Glu Leu Glu His Ala          #              13605                                                           - Gln Asp Ile Glu Lys Ala Val Gln Val Tyr As - #p Ala Trp Arg Ala Ala          #          13750                                                               - Phe Asn Gln Ile Met Gln Thr Ile Ala Gly Gl - #y Ser Arg Arg Leu Thr          #      13905                                                                   - Asp Ala Asp Arg Asp Glu Glu Leu Asp Lys Le - #u Val Glu Arg Val Glu          #  14050                                                                       - Asp Ala Thr Asp Pro Ala Asp Asp Gln Asn Ly - #s Pro Asn Ala Arg Asn          #               14251415 - #                1420                               - Cys Ile Cys Arg Ser Ser Met Pro Ile Ala Il - #e Pro Ser Ser Ser Gly          #              14405                                                           - Ala Glu Cys Ser Arg Cys Arg Val Gln Tyr Hi - #s Leu Ser Cys Ile Lys          #          14550                                                               - Val Arg Ser Ser Glu Val Ser Arg Ala Glu Gl - #y Gly Trp Val Cys Pro          #      14705                                                                   - Phe Cys Pro Trp Tyr Gly Ser Ala Pro Phe Le - #u Lys Met Arg Lys Ala          #  14850                                                                       - Ile Ser Ile Ala Asp Leu Ser Lys Leu Val Ty - #r Asp Gln Asp His Arg          #               15051495 - #                1500                               - Arg Asp Gln Phe Lys Phe Leu Pro Leu Glu Tr - #p Asp Ala Ile Glu Glu          #              15205                                                           - Val Val Ala Lys Ala Lys Arg Phe Glu Thr Al - #a Ala Lys Arg Met Ile          #          15350                                                               - Lys Thr Leu Ser Leu Met Arg Arg Asp Gln Ly - #s Gln Val Ile Leu Ala          #      15505                                                                   - His Trp Leu Arg Arg Ser Ile Gly Cys Pro Va - #l Asp Val Leu Gly Pro          #  15650                                                                       - Glu Lys Val Asn Met Leu Asp Leu Ile Ser Gl - #u Asn Leu Leu Ala Leu          #               15851575 - #                1580                               - Gly Ser Gln Gln Gly Asp Ala Ala Pro Met Al - #a Pro Val Glu Arg Ile          #              16005                                                           - Lys Ala Ser Thr Pro Ala Arg Ser Asp Glu Ar - #g Thr Glu Glu Thr Thr          #          16150                                                               - Pro Leu Pro Arg Ser Ser Arg Val Pro Ala Pr - #o Ala Asp Arg Asp Ser          #      16305                                                                   - Gly Ser Pro Ala Val Arg Asp Asp Arg Lys Ar - #g Lys Ala Lys Arg Gly          #  16450                                                                       - Lys Arg Ala Lys Leu Val Phe Gln Glu Glu Il - #e Gly Ile Gly Ala Tyr          #               16651655 - #                1660                               - Arg Asp Arg Gln Pro Ile Tyr Cys Leu Cys Hi - #s Glu Pro Glu Ser Gly          #              16805                                                           - Arg Met Ile Ala Cys Asp Lys Cys Met Leu Tr - #p Phe His Thr Asn Cys          #          16950                                                               - Val Arg Leu Asp Asp Pro Pro Asn Leu Gly As - #n Glu Pro Trp Ile Cys          #      17105                                                                   - Pro Met Cys Cys Ile Lys Ala Glu Arg Lys Ty - #r Pro Gln Ala Glu Val          #  17250                                                                       - Arg Val Lys Asp Ile Gly Val Thr Asp Pro As - #p Leu Trp Leu Asp Ile          #               17451735 - #                1740                               - Arg Ala Thr Leu Arg Ser Leu Glu Lys Pro Va - #l Ser Lys Ile Gln Ser          #              17605                                                           - Trp Thr Ser Pro Glu Asn Lys Arg Ile Val Le - #u His Leu Glu Lys Phe          #          17750                                                               - Thr Pro Ala Ile His Ala Glu Glu Val His Se - #r Gln Ile Thr Lys Arg          #      17905                                                                   - Ala Arg Leu Glu Ser Asp Thr Pro Ser Lys Al - #a Arg Val Ser Leu Gly          #  18050                                                                       - Arg Ser Asp Ser Ile Ser Thr Pro Ala Lys Gl - #u Ser Gly Ala Val Pro          #               18251815 - #                1820                               - Tyr Ala Ala Ala Pro Val Pro Ser Glu Ala Va - #l Arg Gly Ile Val Pro          #              18405                                                           - Ala Leu Thr Pro Ala Ala Asp Ser Pro Ala Se - #r Arg Ser Gly Arg Asn          #          18550                                                               - Asp Asp Ser Phe Ala Ala Ala Ser Pro Pro Le - #u Trp Asp Ala Lys Thr          #      18705                                                                   - Gly Pro Ser Pro Gly Asn Ala Ser Ile Glu Tr - #p Ala Gln Ser Ala Arg          #  18850                                                                       - Arg Arg Tyr Ala Glu Gly Met Asp Asn Leu Ty - #r Arg Arg Gly Ile Thr          #               19051895 - #                1900                               - Asp Thr Met Leu Val Arg Phe Tyr Val Gly Tr - #p Asn Gly Arg Thr Leu          #              19205                                                           - Phe His Pro Val Arg Asp Ser Ala Gly Asn Il - #e Val Glu Val Ser Leu          #          19350                                                               - Gly Glu Asn Val Arg Leu His Pro Asp Asp Pr - #o Glu Gly Val Arg Val          #      19505                                                                   - Ile Arg Ala Ala Ile Glu Arg His Ser Val Ly - #s Ala Asp Arg Leu Ala          #  19650                                                                       - Ala Ser His Gly Tyr Gly Gly Glu Met Asp As - #p His Val Tyr Ser Arg          #               19851975 - #                1980                               - Asn Ala Tyr Ser Arg Asp Asp Gly Arg Tyr Th - #r Ala Gln Arg Arg Asp          #              20005                                                           - Pro Pro Val Val Pro Ser Asn Gly Arg Phe Se - #r Met Arg Ser Pro Ala          #          20150                                                               - Thr Ile Pro Ser Gln Arg Leu Gly Ser Asp Ar - #g Asp Tyr Glu Arg Glu          #      20305                                                                   - Arg Glu Arg Asp Gly Asp Leu His Asp Ala Ar - #g Asp Gly Arg Asp Gly          #  20450                                                                       - Arg Tyr Gly Asp Ser Leu Arg Ser Pro Ala Al - #a Pro Val Ala Ala Met          #               20652055 - #                2060                               - Thr Ala Pro Gly Ala Leu Asp Thr Ser Pro Al - #a Leu Arg Thr Asn Leu          #              20805                                                           - Ala Arg Glu Val Val Pro Thr Tyr Ala Arg Se - #r Ser Ala Asn Ala Ser          #          20950                                                               - Ala Thr Thr Ser Pro Tyr Thr Gly Ala Ala Se - #r Thr Tyr Ser Ile Tyr          #      21105                                                                   - Ser Ala Ser Asp Arg Ala Ala Ser Tyr Pro Va - #l Gly Arg Ser Ser Ile          #  21250                                                                       - Ser Gln Ala Asp Leu Asp Gly Asn Arg Gly Gl - #y Pro Pro Pro Met Ala          #               21452135 - #                2140                               - Met Tyr Ala Ser Ala Lys Ala Glu Pro Val Al - #a Asn Gly Ser Thr Phe          #              21605                                                           - Ser Ala Leu Asp Pro Ala Met Met Ala Asp As - #p Ala Ala Gly Gln Ile          #          21750                                                               - Asp Pro Asn Leu Thr Ser Ser Pro Val Leu Al - #a Ser Asn Ser Ala Val          #      21905                                                                   - Pro Ala Pro Ser Thr Ala Pro Ala Ala Ala Hi - #s Gly Val Arg Ser Glu          #  22050                                                                       - Thr Arg Ser Arg Pro Pro Ser Ala Gly Asn Gl - #u Val Ala His Glu Ala          #               22252215 - #                2220                               - Gly Ser Ala Lys Ala Pro Pro Gly Ala Pro Se - #r Gly Gly His Ser Gly          #              22405                                                           - Glu Ile Lys Glu His Asn Pro Asp Glu His Gl - #u Leu Glu Ser Val Arg          #          22550                                                               - Gln Gln Ala Arg Gln Met Ala Arg Lys Met Ar - #g Pro Asp Ala Ser Glu          #      22705                                                                   - Ala Asp Ile Glu Arg Leu Val Gln Asn Phe Il - #e Gly Gly Gly Glu Ser          #  22850                                                                       - Lys                                                                          __________________________________________________________________________ 

We claim:
 1. A nucleic acid fragment from the fungus Ustilago maydis, which comprises the XbaI-BglII fragment depicted in FIG.
 1. 2. A nucleic acid fragment as claimed in claim 1, which is capable of functional complementation of an Ustilago maydis mutant which constitutively expresses egl1.
 3. A nucleic acid fragment which hybridizes under standard with a nucleic acid fragment as claimed in claim 1 and which is capable of functional complementation of an Ustilago maydis mutant which constitutively expresses egl1.
 4. A method of identifying a potential fungicide, which method comprises bringing a haploid strain of Ustilago maydis, which strain comprises the XbaI-BglII nucleic acid fragment defined in claim 1, and which does not express endoglucanase egl1, into contact with a compound which is to be tested for fungicidal action; and measuring the amount of endoglucanase egl1 expressed by the haploid strain; wherein an increased expression of egl1 indicates that the tested compound is a fungicide.
 5. The method of claim 4, wherein the site of action of the compound which is to be tested for fungicidal action is a gene product of the XbaI-BglII nucleic acid fragment.
 6. The method of claim 4, wherein the site of action of the compound which is to be tested for fungicidal action is a gene product of the XbaI-BglII nucleic acid fragment which is represented by the amino acid sequence depicted in SEQ ID NO:2.
 7. A method of producing a gall in a corn plant, wherein said gall lacks black discoloration, which method comprises infecting the corn plant with a strain of Ustilago maydis which harbors the nucleic acid fragment defined in claim 1; waiting for a gall lacking black discoloration to be formed; and recovering the formed gall.
 8. A nucleic acid fragment as claimed in claim 1, having the nucleic acid sequence indicated in SEQ ID NO:1 between the BglII and the XbaI cleavage site.
 9. The gene product of the nucleic acid fragment as claimed in claim 8, which is represented by the amino acid sequence depicted in SEQ ID NO:2.
 10. A food product made from the galls of a corn plant, wherein said galls are caused by a strain of Ustilago maydis which harbors the nucleic acid fragment defined in claim
 1. 